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COMMUNICATION SYSTEM, 
INFORMATION PROCESSING APPARATUS 
AND METHOD, AND STORAGE MEDIUM 

5 BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to a communication 
system, an information processing apparatus and an 
information processing method which can communicate an 
10 image or a voice, and a storage medium which stores 
this method. 
Related Background Art 

In recent years, an image and a voice can be 
transmitted and received (i-e,, managed) among plural 
15 information processing apparatuses through a 

communication line. For example, the image and the 
voice can be transmitted and received among plural 
personal computers connected to the communication line, 
by using an internet or the like. 
20 In such case of managing image data, it is 

necessary to reduce data capacity as much as possible. 
In order to do so, when moving image data (or dynamic 
image data) is managed, a method for reducing a frame 
rate has been known. Also, it has been known technique 
25 to compress the moving image data and still image data 
themselves by using an MPEG (motion picture expert 
group) system, a JPEG (joint photographic expert group) 
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system and the like. 

Conventionally, when a reception-side apparatus 
received the image and the voice transmitted from a 
transmission side, the reception- side apparatus has 
caused a monitor to display the image and caused a 
speaker to output the voice, whereby contents of the 
image and the voice have been known or recognized by an 
operator. Therefore, the operator has determined 
importance of the image and the voice by confirming 
such the contents thereof as occasion arose. 

Further, the operator has properly set quality of 
the image displayed on the monitor. 

As described above, usage of the conventional 
apparatus capable of communicating the image and the 
voice has been poor. 

SUMMARY OF THE INVENTION 

The present invention is made in consideration of 
the above-described related background art, and an 
object thereof is to perform, in a communication system 
or apparatus which can communicate an image and a 
voice, image displaying and voice outputting which are 
convenient to an operator or a user. 

Concretely, the object of the present invention is 
to provide technique which can conveniently or easily 
direct or teach to the operator importance of the image 
or the voice to be communicated. 
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In order to achieve the above object, according to 
one preferred embodiment of the present invention, it 
is provided a communication system comprising a 
transmission apparatus for transmitting the image and 
5 the voice to be added to such the image, and a 

reception apparatus for receiving such the image and 
the voice, wherein 

the transmission apparatus comprises a 
transmission means capable of selectively transmitting 
10 the image and the voice to the reception apparatus, and 
the reception apparatus comprises a control means 
for controlling the image received from the 
transmission apparatus and causing a predetermined 
display means to display the controlled image, on the 
15 basis of the voice transmitted by the transmission 
apparatus . 

An another object of the present invention is to 
display the received image of which importance is 
probably high or which the user is probably interested 

20 in, in a state that the user can conveniently or easily 
watch the displayed image. 

In order to achieve the above object, according to 
one preferred embodiment of the present invention, it 
is provided a communication system comprising a 

25 transmission apparatus for transmitting the image and 
the voice to be added to such the image, and a 
reception apparatus for receiving such the image and 
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the voice, wherein 

-the transmission apparatus comprises, 

a data amount control means for controlling a 
data amount of the image on the basis of a level of the 
5 voice to be added to such the image, and 

a transmission means for transmitting the 
image of which data amount was controlled by the data 
amount control means, and 

the reception apparatus comprises, 
10 a reception means for receiving the image 

transmitted by the transmission means, and 

a display control means for causing a 
predetermined display means to display the image 
received by the reception means. 
15 The above and other objects, features, and 

advantages of the present invention will be apparent 
from the following detailed description and the 
appended claims in conjunction with the accompanying 
drawings . 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram showing an entire system 
used in embodiments of the present invention; 

Fig* 2 is a block diagram showing an entire system 
25 basing the embodiments of the present invention; 

Fig. 3 is a flow chart of a still image 
transmission process; 
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Fig. 4 is a flow chart of a moving image 
■transmission process; 

Fig. 5 is a flow chart of an information switch 
program; 

5 Fig. 6 is a flow chart showing execution procedure 

of the information switch program in a first 
embodiment ; 

Fig. 7 is a view showing an example of displaying 
used to designate a transmission site and select 
10 information intended to be received and displayed; 

Fig. 8 is a view showing an example of 
simultaneously displaying the moving image and the 
still image received from the plural transmission 
sites; 

15 Fig. 9 is a flow chart of the information switch 

program in a second embodiment; 

Fig. 10 is a flow chart of the information switch 
program in a third embodiment; 

Fig. 11 is a flow chart of the information switch 
20 program in a fourth embodiment; 

Fig. 12 is a flow chart of the information switch 
program in a fifth embodiment; 

Fig. 13 is a view showing an example of a camera 
control window in the fifth embodiment; 
25 Fig. 14 is a block diagram showing a system used 

in a sixth embodiment; 

Fig. 15 is a block diagram showing a basic system; 
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Fig, 16 is a flow chart of a still image delivery 
process; 

Fig. 17 is a flow chart of a moving image delivery 
process; 

5 Fig. 18 is a flow chart showing operation 

procedure of an information switch program 380 ' ; 

Fig. 19 is a flow chart showing operation 
procedure of an information switch program 381'; 

Fig. 20 is a view showing an example of an image 
10 plane used to designate a transmission site 8 ' and 
select received information in the embodiments; 

Fig. 21 is a view showing an example of a display 
image plane of a reception site 9' in the embodiments; 
Fig. 22 is a flow chart showing operation 
15 procedure of an information switch program 382'; 

Fig. 23 is a flow chart showing operation 
procedure of an information switch program 383'; 

Fig. 24 is a flow chart showing operation 
procedure of an information switch program 384 ' ; 
20 Fig. 25 is a block diagram showing a system used 

in a seventh embodiment; 

Fig. 26 is a block diagram showing a system used 
in an eighth embodiment; and 

Fig. 27 is a block diagram showing a system used 
25 in a ninth embodiment. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
[First Embodimeni:] 

Hereinafter, the first embodiment of the present 
invention will be explained with reference to 
5 accompanying drawings. 

Fig. 2 is a block diagram of an entire 
communication system which bases the present 
embodiment . 

In Fig. 2, reference numeral 1 denotes a computer 
10 such as a personal computer, a work station or the like 
which has a CPU; 2 denotes a monitor such as a CRT 
display or the like; 3 denotes a storage apparatus 
which stores and holds a program and data; 4 denotes a 
camera which photographs or takes an image; 5 denotes a 
15 microphone which inputs a voice; 6 denotes a speaker 

which outputs the voice; 7 denotes a network; 8 denotes 
a transmission site; and 9 denotes a reception site. 

In the storage apparatus 3 of the transmission 
site 8, a still image information transmission program 
20 310, a moving image information transmission program 
320 and a voice information transmission program 330 
are stored . 

In the storage apparatus 3 of the reception site 
9, a still image information display program 340, a 
25 moving image information display program 350, a voice 

information reproduction program 360 and an information 
switch program 370 are stored. 
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The still image informa-tion transmission program 
310 is the program which is used to capture or obtain a 
still image from the camera 4 into the computer 1 
through a video board, and transmit still image 
5 information through the network 7, 

The moving image information transmission program 
320 is the program which is used to capture or obtain a 
moving image from the camera 4 into the computer 1 
through the video board, and transmit moving image 
10 information through the network 7. 

The voice information transmission program 330 is 
the program which is used to capture or obtain the 
voice from the microphone 5 into the computer 1 through 
a sound board, and transmit voice information through 
15 the network 7. 

The still image information display program 340 is 
the program which is used to receive the still image 
information through the network 7, and display the 
still image on the monitor 2 of the computer 1, 
20 The moving image information display program 350 

is the program which is used to receive the moving 
image information through the network 1 , and display 
the moving image on the monitor 2 of the computer 1. 

The voice reproduction program 360 is the program 
25 which is used to receive the voice information through 
the network 7, reproduce the voice by the computer 1, 
and output the reproduced voice by the speaker 6. 
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These programs are read and initiated by the 
computers 1 of the transmission site 8 and the 
reception site 9, whereby the still image, the moving 
image and the voice can be transmitted. 
5 The still image is transmitted on the basis of, 

e.g., a flow chart shown in Fig, 3, 

When the reception site 9 wishes to receive and 
display the still image from any of the plural 
transmission sites, the site 9 designates any of the 
10 transmission sites and transmits such a fact to the 
designated site (e.g., transmission site 8) (step 
S301 ) . 

Then, when the designated transmission site (e.g., 
transmission site 8 ) detects the fact that own site was 
15 designated, the site 8 initiates the still image 

information transmission program 310 to transmit the 
still image (i.e., still image information) to the 
reception site 9 (step S302). 

Subsequently, the reception site 9 displays on its 
20 monitor 2 the still image information received through 
the network 7, by initiating the still image 
information display program 340 (step S303). 

Like the still image, the moving image is also 
transmitted on the basis of a flow chart shown in Fig. 
25 4. 

When the reception site 9 wishes to receive and 
display the moving image from any of the transmission 
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sites, the site 9 designates the transmission site and 
transmits such a fact to the designated site (e-g-, 
transmission site 8) (step S401), 

The transmission site, which received and detected 
the fact that own site had been designated, initiates 
the moving image information transmission program 320 
to transmit the moving image (i-e-, moving image 
information) to the reception site 9 (step S402). 

Subsequently, the reception site 9 displays on its 
monitor 2 the moving image information received through 
the network 7, by initiating the moving image 
information display program 350 (step S403)- 

Such the moving image can be displayed until the 
reception site 9 detects that the moving image 
terminates ( step S404 ) . 

It should be noted that the voice can be 
transmitted by the same process as that of Fig. 4 for 
transmitting the moving image. 

Subsequently, the information switch program 370 
stored in the storage apparatus 3 of the reception site 
9 will be explained hereinafter. 

Fig. 5 shows executing procedure of the 
information switch program 370. 

When the information switch program 370 is 
initiated, the flow waits for an input event such as . 
key inputting, mouse clicking or the like (step S501 ) . 
Then, it is judged in a step S502 whether or not 



there is the operator's inputting to designate the 
transmission site. If yes, the transmission site is 
designated based on such the inputting ( step S503 ) . 
Subsequently, the information required by the reception 
site 9 is selected by the operator from among the still 
image information, the moving image information and the 
voice information, and such a fact is transmitted to 
the transmission site 8 (step S504)* 

Since the transmission site 8 transmits the 
information according to such the selecting, this 
information is received by the reception site 9 
hereafter ( step S505 ) . 

When a termination event occurs (step 8506), the 
information switch program 370 terminates. 

Subsequently, the feature of the present invention 
will be explained in detail with reference to the 
accompanying drawings. 

The system of the present invention includes the 
plural transmission sites 8, and the images can be 
received in parallel from these transmission sites 8 
and displayed on the monitor 2. Therefore, as shown in 
Fig. 8, when the still images and/or the moving images 
are actually received in parallel from the plural 
transmission sites 8 (e.g., transmission sites A, B and 
C), the received still images and/or the moving images 
are simultaneously displayed on three different windows 
12. 
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Further, in addition to the image, the voice to be 
added to such the received image can be also received. 
Therefore, in Fig, 8, the three different voices from 
the three transmission sites A, B and C are 
simultaneously outputted or produced from the speaker 6. 

Fig. 1 is a block diagram showing an entire 
communication system which is obtained by replacing the 
information switch program 370 in the system of Fig. 2 
by an information switch program 371. Therefore, since 
the system structure shown in Fig. 1 is substantially 
the same as that shown in Fig. 2, only execution 
procedure of the replaced information switch program 
371 will be explained in detail hereinafter. 

Fig. 6 is a flow chart showing execution procedure 
of the information switch program 371. 

When the information switch program 371 is 
initiated, the flow waits for an input event such as 
key inputting, mouse clicking or the like (step S601). 

Then, in a step S602, when the input event occurs, 
an image is displayed on the monitor 2, in a form shown 
in Fig. 7, and the flow stands by until the operator 
inputs the transmission site into a transmission site 
designation portion 10 by using a keyboard of the 
computer 1 ( step S603 ) - 

When the transmission site is inputted in the st^p 
S603, the flow stands by until any of still image, 
moving image and voice buttons 11 of Fig. 7 is selected 
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and a de-fcermination but;t:on is depressed by the operator 
(step S604). 

The present invention is not limited to the above 
operation in which only one of the buttons 11 is 
selected. That is, two buttons may be selected. For 
example, when the voice and still image buttons or the 
voice and moving image buttons are selected, the 
reception site 9 can receive the still image with the 
voice or the moving image with the voice. 

When the information is selected in the step S604, 
such a fact is transmitted to the transmission site 8. 
Then, any of the still image, the moving image and the 
voice which is corresponding to the selecting is 
received from such the transmission site 8, the 
received still image or the moving image is displayed 
on the monitor 2, and the received voice is outputted 
from the speaker 6 (step S605). For example, if the 
images are received from the three transmission sites 
8, such the image plane as shown in Fig. 8 is displayed 
on the monitor 2. 

Subsequently, when the still image or the moving 
image is received and displayed, it is judged whether 
or not a voice level of the voice added to the received 
image changes (step S606). In this case, it should be 
noted that such the voice is the voice to be added in 
case of receiving the image and is the voice captured 
or obtained from the microphone 5 of the transmission 
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site 8. 

When it is judged in the step S606 that the voice 
level changed, one of the three images in Fig. 8 of 
which voice level is highest (to be referred as highest 
5 voice-level image hereinafter) is emphasized and 
displayed (step S607), 

In Fig. 8, the voice added to the image of the 
transmission site B has the highest level, an outer 
frame of this image is emphasized by a fat line. Thus^ 
10 the highest voice-level image, i.e., the interested 

image which probably most changed, can be displayed in 
a state easy to be perceived visibly. 

Further, the highest voice- level image may be 
displayed in high resolution. By such displaying, only 
15 the interested image which probably most changed can be 
displayed as a highly precise image. In this case, 
since the images other than the highest voice-level 
image can be displayed in low resolution, a load of the 
image process can be reduced. 
20 Furthermore, only the highest voice- level image 

may be displayed in color. By such displaying, only 
the interested image which probably most changed can be 
displayed in good or satisfactory image quality. In 
this case, since the images other than the highest 
25 voice-level image can be displayed in monochrome, the, 
load of the image process can be reduced. Moreover, 
even if only the highest voice-level image is displayed 
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as a 16-bit (gradation) color image and the other 
images are displayed as 8-bit (gradation) color images, 
the same effect as above can be derived. 

Furthermore, the highest voice-level image may be 
5 enlarged and then displayed. By such displaying, only 
the interested image which probably most changed can be 
displayed as a large image. In this case, since the 
images other than the highest voice-level image can be 
displayed as same-size or reduced images, the load of 

10 the image process can be reduced. 

Furthermore, when the highest voice- level image is 
selected and projected from a projector or displayed on 
a large image plane monitor (projector or monitor is 
provided independently), the interested image which 

15 probably most changed can be selectively displayed in 
large. 

Furthermore, only the highest-level voice may be 
outputted from the speaker 6. By such outputting, only 
the voice corresponding to the interested image which 
20 probably most changed can be made easy to be listened. 

Furthermore, controlling may be performed such 
that the highest-level voice is outputted from the 
speaker 6 in higher volume than those of the other 
voices. By such the controlling, the same effect as 
25 above can be derived. 

Finally, when a termination event occurs in a step 
S606, the infoirmation switch program 371 terminates. 
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In the step S607, the above process is performed 
on the image corresponding to the voice of which level 
is highest (i.e., highest voice-level image)- However, 
the above process may be performed on all the images 
5 corresponding to the voice of which level is equal to 
or higher than a predetermined value. 

Further, the above process may be performed on the 
image corresponding to the voice of which level highly 
changed- By this process, the image which is supposed 

10 that it highly changed since the voice highly changed 
can be displayed in the state easy to be perceived by 
the operator. 

Furthermore, the above process may be performed on 
all the images corresponding to the voice of which 

15 level is equal to or smaller than the predetermined 

value. By this process, the image information from the 
transmission site which is supposed to be not so 
important can be displayed in the state easy to be 
perceived by the operator. 

20 [Second Embodiment] 

Hereinafter, a modified embodiment of the first 
embodiment will be concretely explained as the second 
embodiment. In the present embodiment, it will be 
explained a system in which initial setting has been 

25 performed to receive both the moving image and the 
voice . 

In the present embodiment, since only execution 
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procedure of an information switch program is slightly 
different from the execution procedure of the 
information switch program 371 of Fig. 1, the concrete 
explanation of the system itself is omitted. The 
5 information switch program in the present embodiment is 
called as an information switch program 372 
hereinafter. Fig. 9 is a flow chart of the information 
switch program 372. 

When the information switch program 372 is 

10 initiated, the flow waits for the input event such as 
key inputting, mouse clicking or the like (step S901). 

When the input event occurs (step S902 ) , the 
transmission site is designated ( step S903 ) , and the 
moving image information and the voice information are 

15 received from the designated transmission site 8 and 
displayed (step S904). 

Then, when the voice level of the received and 
displayed image changed (step S905), if such the voice 
level is lower than a predetermined value (step S906), 

20 the controlling is performed such that the still image 
is received from such the transmission site 8. In 
other words, a still image transmission instruction is 
transmitted to the transmission site 8, and then the 
still image transmitted according to this instruction 

25 is received and displayed (step S907). 

On the other hand, if the voice level is higher 
than the predetermined value, the controlling is 
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performed such that the moving image is received from 
such the transmission site 8, In other words, a moving 
image transmission instruction is transmitted to the 
transmission site 8, and then the moving image 
5 transmitted according to this instruction is received 
and displayed (step S908). 

Therefore, the voice level of the voice which was 
added to the displayed image is firstly judged. In 
this case, although the still image or the moving image 

10 can be displayed, it is assumed that the moving image 
is displayed as an initial image. Then, if the voice 
level is low, it is judged that such the image is not 
so important, whereby the still image is displayed. On 
the other hand, if the voice level is high, it is 

15 judged that such the image is important, whereby the 

moving image is displayed. As a result, the important 
image can be real-time-displayed. 

Finally, when the termination event occurs (step 
S909), the information switch program 372 terminates. 

20 In the present embodiment, the initial setting has 

been performed to receive both the moving image and the 
voice. However, even if the initial setting is 
performed to receive both the still image and the 
voice, the same effect as above can be derived. 

25 In the above-described embodiments, the 

information switch programs 370 and 371 are stored at 
the side of the reception site 9. However, steps 
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(i.e., processes) of each program may be divisionally 
performed by both the transmission site 8 and the 
reception site 9, by dividing these steps into those to 
be performed by the transmission site 8 and those to be 
performed by the reception site 9. 

According to the present embodiment, the image of 
which importance is supposed to be low is received and 
displayed as the still image, data transfer efficiency 
can be improved* 
[Third Embodiment] 

It should be noted that basic system structure in 
the present embodiment is the same as that shown in 
Fig. 1. The feature of the present embodiment is that, 
in case of receiving and displaying the still image 
information, the still image at the time when the voice 
level of the transmission site becomes high is newly 
received from such the transmission site. 

Hereinafter, the present invention will be 
explained with reference to the accompanying drawings. 

The entire block diagram of the present embodiment 
is illustrated in Fig. 1. Therefore, in the present 
embodiment, since only execution procedure of an 
information switch program is slightly different from 
the execution procedure of the information switch 
program 371 of Fig. 1, the concrete explanation of the 
system structure itself is omitted. The information 
switch program in the present embodiment is called as 
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an information switch program 373 (not shown) 
hereinafter . 

Fig- 10 is a flow chart of the information switch 

program 373 . 

When the information switch program 373 is 
initiated, the flow waits for the input event such as 
key inputting, mouse clicking or the like (step SlOOl)- 

When the input event occurs (step S1002), the 
transmission site is designated by the operator (step 
S1003), necessary information from among the still 
image information, the moving image information and the 
voice information is selected by using such the buttons 
11 as shown in Fig. 7, and a determination button is 
depressed (step S1004). 

Like the first and second embodiments, the still 
image and/or the moving image and the voice (i.e., 
image and voice information) obtained in the above 
selection are received from the transmission site 8 on 
the basis of negotiation with such the transmission 
site 8. Then, the received image is displayed on the 
monitor 2 and the received voice is outputted from the 
speaker 6 (step S1005). 

When the voice level corresponding to the received 
and displayed image (e*g., still image in the present 
embodiment) changes (step S1006), if there is the image 
of which voice level changed higher than a 
predetermined value (step S1007), the new still image 
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is received again from the transmission site 8 from 
which the voice- level changed image was received (step 
S1008). In this case, of course, such the transmission 
site 8 again photographs or takes the new still image 
by the camera 4 and then transmits the photographed 
image to the reception site 9. 

After the new still image was received and 
displayed because the voice level of the previous still 
image became higher than the predetermined level, when 
such the voice level again becomes lower than the 
predetermined level in the step S1007, it may be 
stopped the displaying of the new still image currently 
performed on the monitor 2 ( step SlOlO ) . 

Further, if the operator does not wish to cancel 
the displaying of the still image once displayed, the 
step SlOlO may be cancelled. 

On the other hand, when the voice level 
corresponding to the received and displayed image 
(i.e., still image in the present embodiment) does not 
change in the step S1006, the still image displayed on 
the monitor 2 of the reception site 9 does not change. 

When the termination event occurs ( step S1009 ) , 
the information switch program 373 terminates. 

In the present embodiment, the information switch 
program 373 is stored at the side of the reception site 
9. However, the program 373 may be stored at the side 
of the transmission site 8 and controlled by such the 
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transmission site 8. Further, when such the program 
373 stored in the transmission site 8 is read and 
initiated by the reception site 9, this program can be 
controlled from the reception site 9. 

According to the present embodiment, as to the 
still image of which voice level is high, i.e., as to 
the still image of which importance is supposed to be 
high, the new still image is frequently received. 
Therefore, as the importance of the image becomes 
higher, such the image can be more real-time received 
and displayed. On the contrary, as to the still image 
of which voice level is low, such the still image is 
hardly updated, a transmission data amount on the 
network 7 can be reduced. 
[Fourth Embodiment] 

In the fourth embodiment, at the time when the 
voice level of the voice corresponding to the image 
(i.e., still image or moving image) which is not yet 
transmitted becomes high, it is started to receive the 
still image or the moving image corresponding to such 
the voice from the transmission site. That is, only 
the voice is received from the transmission site in an 
initial state . 

Like the second and third embodiments, the present 
embodiment also uses the system structured based on the 
entire block diagram illustrated in Fig. 1. However, 
only the information switch program 371 in Fig. 1 is 
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replaced by an informat;ion switch program 374 (not: 
shown ) - 

Fig. 11 is a flow chart: concerning execution 
procedure of the information switch program 374 which 
will be explained in detail hereinafter. 

When the information switch program 374 is 
initiated, the flow waits for the input event such as 
key inputting, mouse clicking or the like (step SllOl). 

When the input event occurs (step S1102), the 
transmission site is designated by the operator ( step 
S1103), necessary information from among the still 
image information, the moving image information and the 
voice information is selected by using such the buttons 
11 as shown in Fig. 7, and the determination button is 
depressed (step S1104), It is assumed in the present 
embodiment that the still image and the voice were 
selected. 

When the above selection is determined in the step 
SI 104, a transmission instruction of the still image 
and the voice is sent from the reception site 9 to the 
transmission site 8, and the transmission site 8 
prepares the transmission of the still image and the 
voice in accordance with such the instruction. 
However, only the voice is first received by the 
reception site 9 and outputted from the speaker 6 (step 
S1105) , 

Then, when the voice level of the outputted voice 
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Changes (step 31106)^ if the changed voice level is 
higher than a predetermined value (step S1107), 
negotiation with the transmission site 8 is performed 
such that the still image corresponding to the 
outputted voice is received from such the transmission 
site 8, and then the still image is actually received 
(step S1108). 

On the other hand, if the voice level is not 
higher than the predetermined value, the still image is 
not received until such the voice level becomes higher 
than the predetermined value, and only the voice is 
continuously received. 

Finally, when the termination event occurs (step 
S1109), the information switch program 374 terminates* 
In the present embodiment, it was explained that 
the still image and the voice are transmitted. 
However, the moving image and the voice can be 
transmitted in the same manner. 

By the above operation, since the image 
information concerning the important image of which 
voice level became high is selectively received, 
effective data receiving can be performed. 
[Fifth Embodiment] 

In the fifth embodiment, image information 
controlling is performed on the transmission site of ^ 
which voice level became high. 

In the present embodiment, only the information 
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switch program 371 in Fig. 1 is replaced by an 
information switch program 375 (not shown)- Further, 
in order to change a photographing range of the camera 
4 of the transmission site 8, a pan, zooming and the 
like can be controlled from the reception site 9 
through the still image information transmission 
program 310 or the moving image information 
transmission program 320- Such the points are 
different from the first embodiment. 

Fig- 12 is a flow chart concerning execution 
procedure of the information switch program 375 which 
will be explained in detail hereinafter. 

When the information switch program 375 is 
initiated, the flow waits for the input event such as 
key inputting, mouse clicking or the like (step S1201). 

When the input event occurs (step S1202), the 
transmission site is designated by the operator (step 
S1203), necessary information from among the still 
image information, the moving image information and the 
voice information is selected by using such the buttons 
11 as shown in Fig. 7, and the determination button is 
depressed (step S1204). 

Then, by performing negotiation with the 
transmission site 8 on the basis of the above 
selection, the still image or the moving image is 
received and displayed on the monitor 2, and the voice 
is received and output ted from the speaker 6 (step 



S1205) . 

If the voice levels of the voices corresponding to 
the images currently displayed on the monitor 2 change 
(step S1206), the image (transmission site) of which 
voice level is highest is determined among the 
displayed images. Then, on the image of the determined 
transmission site, the target image to be camera 
controlled is switched or changed through a camera 
control window shown in Fig. 13, thereby enabling the 
camera controlling (step S1207). 

Concretely, the camera controlling is to control a 
pan angle, a tilt angle and zooming magnification of 
the camera 4 from the reception site 9. 

In the present embodiment, only the voice 
corresponding to the camera-controllable image is 
outputted from the speaker 6. Thus, a condition of the 
transmission site 8 to be camera controlled can be 
easily known or grasped from the reception site 9. 
Further, when only the voice level of the image 
corresponding to the transmission site to be camera 
controlled is made higher than those of the images 
corresponding to the other transmission sites, the same 
effect can be substantially derived. 

Subsequently, the camera controlling is performed 
on the image of the transmission site which was made . 
controllable in the step S1207, by the operator with 
use of the camera control window 13 (step S1208). 
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Thus, the photographing range and the zooming 
magnification of the image transmitted from the 
transmission site is changed (step S1209). 

Finally, when the termination event occurs (step 
S1210), the information switch program 375 terminates. 

Fig. 13 shows an example of a camera control 
interfere. However, the present embodiment is not 
limited to this example. That is, if the camera can be 
controlled in numerals, the pan angle, the tilt angle 
and the zooming magnification may be inputted in such 

the numerals. 

As a modification of the present embodiment, in 
the step S1207, the camera controlling may be performed 
on the image which is received from the transmission 
site of which voice level is lowest or the transmission 
site of which voice level most changes. 

According to the present embodiment, it can be 
controlled the photographing state of the image which 
is supposed to be important because its voice level is 
high or highly changes. 

Further, when the image which was received from 
the transmission site of which voice level is lowest is 
camera controlled, if its photographing range or the 
like is not appropriate, such the range or the like can 
be appropriately changed. 

In the above-described embodiments, if yes in the 
steps S606, S906, S1006, S1106 and S1206 each judging 
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whether or not the voice level changed, the flows 
advanced to the next steps S607, S907, S1007, S1107 and 
81207^ respectively- However, the present invention is 
not limited to such the procedure. That is, the flows 
may advance to the steps S607, S907, S1007, SI 107 and 
S1207, every time a predetermined time elapses. By 
such procedure, the effects derived in the above- 
described embodiments can be also derived irrespective 
of whether or not the voice level changes. 

For example, if the image is supposed to be 
important although its voice level is always high but 
does not change, the appropriate judging can be 
performed by applying such the structure. 

Although the present invention is directed to the 
apparatus which has the above-described structure or 
the system which is composed of such the plural 
apparatuses, it is obviously understood that a method 
which performs the above-described processes and a 
storage medium which stores, in a computer readable 
state, a program to realize such the method are also 
included in the scope of the present invention. 

In the above-described embodiments, when the 
controlling (i.e., image emphasizing, moving image 
reception controlling, still image reception 
controlling or the like) according to the voice level, 
of each image is performed by the reception site (i.e., 
not transmission site), the various controlling can be 
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easily performed in case of an internet in whicln the 
respective transmission sites are distant from others. 

In the above-described embodiments, the various 
controlling were performed according to the voice level 
(i.e., volume) of the voice. However, the present 
invention is not limited to such the operation. That 
is, it is included in the scope of the present 
invention a case where the various controlling are 
performed according to contents of the voice. 

For example, the controlling explained in the 
above-described embodiments may be performed according 
to a frequency (i.e., high frequency or low frequency), 
the contents recognized by voice recognition, or the 
like. 

According to the above -explained embodiments of 
the present invention, since the various controlling is 
performed on the basis of the voice to be added to the 
image, when the image and the voice can be communicated 
between the transmission and reception sites, it can be 
provided the communication system or the image process 
method easily usable. 
[Sixth Embodiment] 

Fig. 15 is a block diagram of a communication 
system which bases the sixth to ninth embodiments. 

In Fig. 15, reference numeral 1' denotes a 
computer such as a personal computer (PC), a work 
station or the like which includes a CPU; 2' denotes a 
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monitor such as a CRT display or the like; 3' denotes a 
storage apparatus which stores and holds a program and 
data; 4' denotes a camera which inputs image 
information; 5' denotes a microphone which inputs voice 
information; 6' denotes a speaker which outputs a 
voice; 7' denotes a network; 8' denotes a transmission 
site; and 9' denotes a reception site. 

In the storage apparatus 3' of the transmission 
site 8 ' , a still image information delivery program 
310', a moving image information delivery program 320' 
and a voice information delivery program 330' are 
stored . 

In the storage apparatus 3' of the reception site 
9', a still image information display program 350', a 
moving image information display program 360', a voice 
information reproduction program 370' and an 
information switch program 380' are stored. 

The still image information delivery program 310' 
is the program which is used to capture or obtain a 
still image from the camera 4' into the computer 1' 
through a video board or the like, and transmit still 
image information through the network 7 ' . 

The moving image information delivery program 320' 
is the program which is used to capture or obtain a 
moving image from the camera 4' into the computer 1' 
through the video board or the like, and transmit 
moving image information through the network 7 ' . 
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The voice inf ormat;ion delivery program 330' is the 
program which is used to capture or obtain the voice 
from the microphone 5' into the computer 1' through a 
sound board or the like, and transmit the voice 
information through the network 7 ' • 

The still image information display program 350' 
is the program which is used to receive the still image 
information through the network 7 ' , and display the 
still image on the monitor 2' of the computer 1, 

The moving image information display program 360' 
is the program which is used to receive the moving 
image information through the network 7 ' , and display 
the moving image on the monitor 2 ' of the computer 1 ' . 

The voice reproduction program 370' is the program 
which is used to receive the voice information through 
the network 7 ' , reproduce the voice by the computer 1 ' , 
and output the reproduced voice by the speaker 6 ' . 

By utilizing these programs through the network 
1' , still image delivery, moving image delivery and 
voice delivery can be performed. 

The still image delivery is realized by the 
procedure shown in a flow chart of Fig. 16. 

If it is intended to display on the reception site 
9 ' the still image transmitted from any one of the 
transmission sites 8 ' , the inputting is performed to . 
designate the transmission site (step S301 ' ) . 

Then, the still image information delivery program 
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310' of the designated transmission site 8' is 
initiated ( step S302 ' ) . 

The still image information received from the 
transmission site 8' through the network 7' is 
displayed on the monitor 2' of the computer 1' on the 
basis of the still image information display program 
350' (step S303' )• 

The moving image delivery is realized by the 
procedure shown in a flow chart of Fig^ 17. 

If it is intended to display on the reception site 
9' the moving image transmitted from any one of the 
transmission sites 8 ' , the inputting is performed to 
designate the transmission site (step S401 ' ) . 

Then, the moving image information delivery 
program 320' of the designated transmission site 8' is 
initiated ( step S402 ' ) . 

The moving image information received from the 
transmission site 8' through the network 7' is 
displayed on the monitor 2' of the computer 1' on the 
basis of the moving image information display program 
360' (step S403' ) . 

Such the moving image displaying continues until 
an instruction to terminate the displaying is inputted 
from the reception site 9' (step S404'). 

Further, the voice delivery can be also realized 
by the same procedure as that for the moving image 
delivery shown in Fig* 17. 
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In the step same as S401 ' , if it is intended to 
reproduce at the reception site 9 ' the voice 
transmitted from any one of the transmission sites 8 ' , 
the inputting is performed to designate the 
transmission site. 

Then, in the step same as S402 ' , a voice encode 
program and the voice information delivery program 330' 
of the designated transmission site 8' are initiated. 

In the step same as S403 ' , the voice information 
received from the transmission site 8' through the 
network 7' is reproduced and outputted from the speaker 
6' on the basis of the voice information reproduction 
program 370'. 

In the step same as S404 ' , such the voice 
reproducing continues until an instruction to terminate 
the reproducing is inputted from the reception site 9 ' . 

Further, an image and voice delivery system can be 
realized on the basis of the information switch program 
380' of the reception site 9'. 

Fig. 18 is a flow chart of the information switch 

program 380' . 

In Fig. 18, when the information switch program 
380' is initiated, the reception site 9' waits for an 
input event by the operator such as key inputting, 
mouse clicking or the like (step S501 ' ) . 

Then, when the input event to designate the 
transmission site 8' occurs (step S502'), the 
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transmission site is designated, e.g., in sucti a form 
as indicated by a numeral 10' in Fig. 20 (step S503'), 
and the information which the operator intends to 
receive is selected from among the still image 
information or the moving image information and the 
voice information by selecting (i.e., clicking) buttons 
11' in Fig. 20 (step S504'). 

In this case, it should be noted that the voice 
can be received together with the still image or the 
moving image. Therefore, any one of the still image, 
the moving image, the voice, the still image and the 
voice, and the moving image and the voice can be 
selected . 

Then, the still image delivery, the moving image 
delivery or the voice delivery corresponding to the 
selected information is performed (step S505'). 

By repeating the procedure in the steps S502 ' to 
S505 ' , it is possible that the still image and the 
moving image (i-e., images 12' to 14 M received from 
the plural transmission sites 8' are displayed as shown 
in Fig. 21, and the voices corresponding to these 
images are outputted. 

Fig. 21 is a view showing a case where the still 
image and moving image are received from the three 
transmission sites 8' and the voices received from the 
three transmission sites 8' are mixed and outputted. 

Subsequently, in the case where "the still image 
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and the voice" or "the moving image and the voice" is 
selected in the selection procedure of the step S504', 
when a reproduced voice level of the voice appendant to 
each image (images 12' to 14') changes (step S506'), 
the displayed image (any one of images 12' to 14') 
which is transmitted from the transmission site 8' and 
to which the highest level (volume) voice is appendant 
is emphasized and displayed (step S507'). In Fig. 21, 
since the level of the voice appended to the still 
image / moving image 12' is highest, a frame of the 
displayed image 12' is emphasized. Thus, the image 
which is most interesting can be emphasized and 
displayed. Such controlling is performed on the basis 
of the still image information display program 350' or 
the moving image information display program 360'. 

Further, when only the voice is selected in the 
selection procedure of the step S504', the actual voice 
level is further enlarged to be reproduced in the step 
S506' to clearly indicate the voice of such the highest 
voice-level transmission site. 

For example, in Fig. 21, in a case where the three 
coupled of "the still image and the voice" 
(corresponding to displayed images 12' to 14') are 
received and output ted from the three transmission 
sites and only the voice is received and outputted from 
the other one transmission site (i.e., four voices are 
mixed and outputted ) , when the voice level from the 



- 36 - 



transmission site transmitting only the voice is 
highest, such the voice is further enlarged, reproduced 
and outputted, whereby the most interesting voice can 
be emphasized and out put ted. 
5 Finally, when the termination event occurs (step 

S508M.^ the information switch program 380' terminates. 

It should be noted in the example of Fig. 21 that 
the frame representing the still image or the moving 
image is made remarkable. However, any other method 
10 for clearly indicating one of the plural images may be 
used. 

Subsequently, the feature of the present 
embodiment to which the basic structure or form shown 
in Fig. 15 is applied will be explained hereinafter 
15 with reference to Fig. 14. 

In Fig. 14, the information switch program 380' of 
Fig. 15 is replaced by an information switch program 
381', and a delivery data amount control program 340' 
is added to the storage apparatus 3' of the 
20 transmission site 8 ' . 

The delivery data amount control program 340' is 
the program which is used to control the data amount 
transmitted from the transmission site 8 ' . 

Controlling of such the data amount is realized by 
25 changing compression ratio of the still image delivered 
based on the still image information delivery program 
310', changing compression ratio and a frame rate of 
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-the moving image delivered based on the moving image 
information delivery program 320', or the like. 

Fig, 19 is a flow chart of the information switch 

program 381 ' . 

In Fig. 19, when the information switch program 
381' is initiated, the reception site 9' waits for an 
input event by the operator such as key inputting, 
mouse clicking or the like (step S601 ' ) . 

Then, when the input event to designate the 
transmission site 8' occurs in the reception site 9' 
(step S602'), an address of the transmission site 8' is 
designated and inputted in the same manner as in the 
step S503' of Fig. 18 (step S603 ' ) , and the information 
which the operator intends to receive is selected (step 
S604'). As described above, any one of the still 
image, the moving image, the voice, the still image and 
the voice, and the moving image and the voice can be 
selected. 

Then, like the process in Fig. 18, the still image 
delivery, the moving image delivery or the voice 
delivery corresponding to the selected information is 
performed from the transmission site 8' (step S605 ' ) . 

Further, as explained in Fig. 18, by repeating the 
procedure in the steps S602' to S605', the plural still 
images and the moving images are displayed on the 
monitor (CRT) 2' as shown in Fig. 21, and their 
corresponding voices are also outputted. 
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Then, in the case where "the still image and the 
voice" or "the moving image and the voice" is received 
and displayed as at least one of the images 12' to 14', 
when the level of the reproduced voice changes (step 
S606 • ) , the image of the transmission site of which 
voice level is highest among the outputted voices is 
emphasized and displayed, such as the image 12' of Fig. 

21 (step S607' ) . 

On the other hand, in a step S608 ' , controlling is 
performed to increase the delivery data amount on the 
displayed image (any one of images 12' to 14') 
corresponding to the transmission site which transmits 
the highest-level voice, and decrease the delivery data 
amounts on the other displayed images. 

By such the controlling, the reception site 9' 
emphasizes the displayed image according to the voice 
level in the steps S606' and S607 ' , and also outputs an 
instruction signal to control the delivery data amount 
to each transmission site 8'. Therefore, in Fig. 21, 
the instruction signal to increase the delivery data 
amount is outputted to the transmission site 8' 
corresponding to the image 12'. On the other hand, the 
instruction signal to decrease the delivery data amount 
is outputted to the transmission sites 8' corresponding 
to the images 13 ' and 14 ' . 

In response to the instruction signal, as each 
transmission site 8' controls the delivery data amount 
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by actually controlling the compression ratio and the 
frame rate (in case of moving image) of the image, it 
transmits the still image and/or the moving image. 

In the present embodiment, it should be noted that 
the still image is compressed in a JPEG (joint 
photographic expert group) compression system and the 
moving image is compressed in an MPEG (motion picture 
expert group) compression system. However, the 
compression system is not limited to them, but another 

systems may be used* 

When the termination instruction event occurs 
(step S609'), the information switch program 381' 
terminates . 

In the above-described delivery data amount 
controlling, e.g., if an effective data band has been 
already determined due to limitations of network, 
controlling can be performed such that the total 
delivery data amount comes into such the band. 

By the above-explained delivery data amount 
controlling, as in the basic structure and operation 
explained in Figs. 15 and 18, the interesting displayed 
image of which voice level is high can be emphasized, 
and moreover such the interesting image can be 
displayed in higher image quality (or higher frame 
rate) than those of the other displayed images. Thus, 
the more convenient image easy to be used can be 
displayed. 
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In -bhe above embodiments, the displayed image to 
which the highest voice-level (volume) voice is 
appendant is emphasized. However, the present 
invention is not limited to such the embodiments* That 
is, it is included in the scope of the present 
invention a case where the image corresponding to the 
voice of which level change (volume change) is largest 
is emphasized and displayed, or a case where the images 
corresponding to the several voices of which level 
changes are large are emphasized and displayed. 

In the present embodiment shown in Figs. 14 and 
19, it is also possible that the voice (i.e., voice 
appendant to emphasized and displayed image) which is 
supposed to be important because its voice level is 
highest or its level change is large can be reproduced 
and outputted from the speaker 6^ after its voice level 
is made larger than the actual level. In this case, 
the user can clearly recognize the relation between the 
emphasized and displayed image and the voice appendant 
thereto. Further, even if only the voice appendant to 
the emphasized and displayed image is outputted, the 
same effect can be derived. 

Furthermore, on the image of the transmission site 
of which importance is supposed to be low because its 
voice level change is small, since such the image is 
displayed in the small data amount, the communication 
data amount can be reduced, whereby data communication 
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can be smoothly performed. 

In the above-described embodiments, the delivery 
data amount of the displayed image to which the 
highest-level voice or the large level-change voice is 
appendant was increased by simply decreasing the 
compression ratio or increasing the frame rate. 
However, if the maximum delivery data amount from each 
transmission site is limited because of a communication 
system, on the displayed image which is supposed to be 
important, it may be controlled that its compression 
ratio is increased and its frame rate is also increased 
(if frame rate is more important), or its compression 
ratio is decreased and its frame rate is also decreased 
( if image quality is more important ) . 
[Seventh Embodiment] 

In the sixth embodiment, the delivery data amount 
of the displayed image was controlled according to the 
voice level, and further such the displayed image was 
processed (i.e-, emphasized and displayed) by the 
predetermined display means. However, the present 
invention is not limited to such the embodiment. 

In the seventh embodiment, by providing plural 
threshold values in voice level for switching the 
delivery data amount, the image is processed and 
displayed. 

Hereinafter, the present embodiment will be 
explained with reference to Figs. 22 and 25. 
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It should be noted that the system structure shown 
in Fig. 25 which is used in the present embodiment is 
basically the same as that shown in Fig. 14. That is, 
only the information switch program 381' in Fig. 14 is 
5 replaced by an information switch program 382' in Fig. 
25. 

For this reason, only the information switch 
program 382' will be explained in detail hereinafter. 
Fig. 22 is a flow chart showing operation 

10 procedure of the information switch program 382 ' . 

In Fig. 22, initial steps S901 ' to S905' are 
substantially the same as the steps S601' to S605' in 
the sixth embodiment. That is, in these steps, the 
still image information, the moving image information 

15 and the voice information are delivered. By repeating 
the steps S902' to S905', the plural still images 
and/or the moving images shown in Fig. 21 are displayed 
on the CRT 2 ' of the reception site 9 ' , and also the 
voices corresponding thereto are outputted. 

20 Subsequently, when the reproduced voice level (any 

one of voices corresponding to images 12' to 14' in 
Fig. 21) changes (step S906'), it is judged whether or 
not its corresponding image is the emphasized and 
displayed image ( step S907 ' ) . 

25 In a case where the image Judged in the step S907' 

is not the emphasized and displayed image, i.e., the 
image 13' or 14* in Fig. 21, if such the voice level is 
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not equal to or larger than a predetermined value a 
(step S908'), the flow returns to the event-loop step 
S901' as it is. On the other hand, if such the voice 
level is equal to or larger than the predetermined 
5 value a (step S908'), such the judged image is also 

emphasized and displayed (step S909 ' ) • That is, it is 
possible in the present embodiment that the plural 
images are emphasized and displayed. For example, if 
the flow advances from the state shown in Fig. 21 to 

10 the step S909 ' , the image 13' or 14' is emphasized and 
displayed in addition to the image 12 ' . 

Further, in a case where the image judged in the 
step S907 ' is the image which has been already 
emphasized and displayed, if the voice level appendant 

15 thereto is not equal to or smaller than a predetermined 
value p (step S910'), the displaying of such the image 
is continued (step S909 ' ) . On the other hand, if such 
the voice level is equal to or smaller than the 
predetermined value p (step S910'), the emphasizing and 

20 displaying of the image is stopped, and the displaying 
state returns to the ordinary state (step S911'). 

By such the processes, all of the displayed images 
of which appendant voice levels are within a 
predetermined level are emphasized and displayed. 

25 Therefore, all the images which are supposed to be 

relatively important can be emphasized and displayed. 

Further, after the processes of the steps S906' to 
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S911' are performed, the delivery data amount is 
controlled. That is, the controlling is performed to 
increase the delivery data amount of the image 
emphasized and displayed and decrease the delivery data 
5 amount of the image ordinarily displayed (step S912'). 

Such the controlling is performed by outputting an 
instruction signal for controlling the delivery data 
amount from the reception site 9 ' to each transmission 
site 8 ' . 

10 Then, when the termination instruction event 

occurs (step S913'), the information switch program 
382 ^ terminates . 

It is assumed that the voice level a is larger 
than the voice level p. When magnitudes of these 

15 levels a and p are appropriately set, it can be 

prevented inconvenience that the image once emphasized 
and displayed soon returns to the ordinarily displayed 
image and thus the desired image becomes difficult to 
be recognized or found* 

20 According to the present embodiment, the image 

which is relatively important or interesting can be 
emphasized and displayed, and further such the 
interesting image can be displayed in higher image 
quality (or higher frame rate) than those of the other 

25 images. Thus, the more convenient image which is easy 
to be used can be displayed. 

Further, since the image which was emphasized and 
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displayed because its voice level was once increased 
can be continuously emphasized and displayed to some 
extent, the user can easily recognize such the 
emphasized and displayed image. 
5 In the present embodiment, it is also possible 

that the voice (i,e., voice appendant to emphasized and 
displayed image) which is supposed to be important 
because its voice level is highest or its level change 
is large can be reproduced and outputted from the 

10 speaker 6 ' after its voice level is made larger than 
the actual level. In this case, the user can clearly 
recognize the relation between the emphasized and 
displayed image and the voice appendant thereto. 
Further, even if only the voice appendant to the 

15 emphasized and displayed image is outputted, the same 
effect can be derived. 
[Eighth Embodiment] 

In the sixth and seventh embodiments, the delivery 
data amount of the displayed image was controlled 

20 according to the voice level. However, the present 
invention is not limited to such the embodiments. 

In the eighth embodiment, the delivery data amount 
is controlled based on the image control information of 
the transmission site 8 ' . 

25 Hereinafter, the present embodiment will be 

explained with reference to Figs. 23 and 26. 

It should be noted that the system structure shown 
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in Fig. 26 which is used in the present embodiment is 
basically the same as that shown in Fig. 14. That is, 
only the information switch program 381' in Fig. 14 is 
replaced by an information switch program 383' in Fig, 
5 26. 

Therefore, only the information switch program 
383' will be explained in detail hereinafter. 

Fig. 23 is a flow chart showing operation 
procedure of the information switch program 383 ' . 

10 Like the sixth and seventh embodiments, the still 

image information delivery, the moving image 
information delivery and the voice information delivery 
are initially performed (steps SlOOl ' , S1002 ' , S1003 ' , 
S1004', S1005M* By repeating the steps S1002 ' to 

15 S1005', the plural still images and the moving images 
shown in Fig. 21 are displayed on the CRT 2' of the 
reception site 9 ' , and the voices corresponding thereto 
are also outputted. 

Subsequently, e.g., when a pan angle, a tilt angle 

20 and zooming magnification of each of the transmission 
sites from which the displayed images 12' to 14' 
(images 12' to 14' are assumed as moving images) are 
transmitted are moved or changed and thus the 
photographing condition of the displayed image changes 

25 (step S1006'), the displayed image corresponding to 

such the transmission site is emphasized and displayed 
such as the image 12' in Fig. 21 (step S1007 ' ) . 
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In this case, the reception site 9' judges the 
change of the photographing condition by receiving a 
control signal to control the photographing condition 
of each transmission site 8 ' . 
5 Subsequently, controlling is performed to increase 

the delivery data amount of the displayed image 
corresponding to such the transmission site and 
decrease the delivery data amounts of the other 
displayed images (step S1008')- Such the controlling 

10 is performed by outputting the instruction signal to 
control the delivery data amount from the reception 
site 9 ' to each transmission site 8 ' . 

When the termination instruction event occurs 
(step S1009')/ "the image information switch program 

15 383 ' terminates ♦ 

In the above-described delivery data amount 
controlling, e.g., if an effective data band has been 
already determined due to limitations of network, 
controlling can be performed such that the total 

20 delivery data amount comes into such the band. 

In the present embodiment, the change in 
photographing condition is judged in the step S1006^ on 
the basis of the control information received from each 
transmission site. However, the present invention is 

25 not limited to such the operation. That is, it can be 
judged that the photographing condition changes, when a 
changing amount of the contents of displayed image is 
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large. 

According "to the present embodiment, the image 
which is relatively important or interesting because 
its photographing condition changes can be emphasized 
5 and displayed. Further, such the interesting image can 
be displayed in higher image quality (or higher frame 
rate) than those of the other displayed images. Thus, 
the more convenient image which is easy to be used can 
be displayed. 

10 In the present embodiment, it is also possible 

that the voice (i.e., voice appendant to emphasized and 
displayed image) which is supposed to be important 
because its voice level is highest or its level change 
is large can be reproduced and outputted from the 

15 speaker 6' after its voice level is made larger than 
the actual level. In this case, the user can clearly 
recognize the relation between the emphasized and 
displayed image and the voice appendant thereto. 
Further, even if only the voice appendant to the 

20 emphasized and displayed image is outputted, the same 
effect can be derived. 

Furthermore, on the image of the transmission site 
of which importance is supposed to be low because its 
voice level is low or its voice level change is small, 

25 since such the image is displayed in the small data 

amount, the communication data amount can be reduced, 
whereby data communication can be smoothly performed. 
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In the above-described embodimeni:s, the delivery 
data amount of the displayed image to which the 
highest-level voice or the large level-change voice is 
appendant was increased by simply decreasing the 
5 compression ratio or increasing the frame rate. 

However, if the maximum delivery data amount from each 
transmission site is limited because of a communication 
system, on the display image which is supposed to be 
important, it may be controlled that its compression 

10 ratio is increased and its frame rate is also increased 
(if frame rate is more important), or its compression 
ratio is decreased and its frame rate is also decreased 
(if image quality is more important). 
[Ninth Embodiment] 

15 In the sixth to eighth embodiments, the displayed 

image was emphasized or the delivery data amount (image 
quality, frame rate) was controlled in accordance with 
the voice level or the image photographing condition. 
However, the present invention is not limited to such 

20 the embodiments. 

That is, in the eighth embodiment, by obtaining 
some information concerning the image information at 
the transmission site, the delivery data amount is 
changed, and also the displayed image is emphasized. 

25 Hereinafter, the present embodiment will be 

explained with reference to Figs. 24 and 27. 

It should be noted that the system structure shown 
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in Fig. 27 used in the present embodiment is basically 
the same as that shown in Fig. 14. That is, the 
information switch program 381' in Fig. 14 is replaced 
by an information switch program 384' and a heat sensor 
5 4" is newly added in Fig. 27. 

Therefore, only the information switch program 
384' will be explained in detail hereinafter. 

Fig. 24 is a flow chart showing operation 
procedure of the information switch program 384'. 

10 Like the sixth to eighth embodiments, the still 

image information delivery, the moving image 
information delivery and the voice information delivery 
are performed (steps SllOl', S1102', S1103', S1104', 
S1105'). By repeating the steps S1102' to S1105', the 

15 plural still images and the moving images shown in Fig. 
21 are displayed on the CRT 2' of the reception site 
9 ' , and the voices corresponding thereto are also 
outputted . 

In the present embodiment, as shown in Fig. 27, 
20 the heat sensor 4" is appended to the camera 4' of each 
transmission site 8'. Thus, since a temperature (air 
temperature, water temperature or the like) at 
photographing spot is detected by the heat sensor 4", 
temperature information can be transmitted through a 
25 control line connecting the camera 4' to the computer 
1' every time a temperature change occurs. 

In the flow chart of Fig. 24, when the temperature 
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information is inputted from the heat sensor 4" of any 
one of the transmission sites and such the temperature 
information is equal to or larger than a predetermined 
temperature (step S1106')/ the displayed image which is 
5 received from the transmission site corresponding to 

such the heat sensor 4" is emphasized such as the image 
12' in Fig. 21 (step S1107'). 

Subsequently, controlling is performed to increase 
the delivery data amount of such the displayed image to 

10 be emphasized and decrease the delivery data amounts of 
the other image information (step S1108')- Such the 
controlling is performed by outputting an instruction 
signal for controlling the delivery data amount from 
the reception site 9' to each transmission site 8'. 

15 When the termination instruction event occurs in 

the reception site 9' (step S1109')/ the information 
switch program 384' terminates. 

In the present embodiment, when the temperature 
information represents the temperature equal to or 

20 larger than the predetermined temperature in the step 

S1106', the image corresponding to such the temperature 
information is emphasized and displayed. However, the 
present invention is not limited to such the operation. 
That is, it is included in the scope of the present 

25 invention a case where the image is emphasized and 

displayed when the temperature information represents 
the temperature equal to or lower than the 
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predetermined temperature . 

Further, in the step S1106', the displaying 
according to the temperature represented by the 
temperature information may be performed. That is, it 
5 is included in the scope of the present invention a 
case where the displayed image corresponding to the 
temperature information representing the temperature 
equal to or lower than m degree is emphasized and 
displayed by using a blue frame, the displayed image 

10 corresponding to the temperature information 

representing the temperature larger than m degree and 
lower than n degree is emphasized and displayed by 
using an yellow frame, and the displayed image 
corresponding to the temperature information 

15 representing the temperature equal to or larger than n 
degree is emphasized and displayed by using a red 
frame. In this case, the delivery data amount is 
controlled to satisfy, e-g-, "delivery data amount of 
blue- frame displayed image" < "delivery data amount of 

20 yellow- frame displayed image" < "delivery data amount 
of red-frame displayed image". 

In the above-described delivery data amount 
controlling, e.g., if an effective data band has been 
already determined due to limitations of network, 

25 controlling can be performed such that the total 
delivery data amount comes into such the band. 

In the present embodiment, it is also possible 
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trha-b t:he voice (i.e., voice appendant to emphasized and 
displayed image) which is supposed to be important 
because its voice level is highest or its level change 
is large can be reproduced and output ted from the 
5 speaker 6' after its voice level is made larger than 
the actual level. In this case, the user can clearly 
recognize the relation between the emphasized and 
displayed image and the voice appendant thereto. 
Further, even if only the voice appendant to the 

10 emphasized and displayed image is outputted, the same 
effect can be derived. 

Further, when a frequency in reception of the 
temperature information from the heat sensor 4" 
corresponding to one displayed image is small, it is 

15 judged that the change in temperature is small and the 
change in displayed image is also small. Thus, the 
controlling may be performed to decrease the delivery 
data amount of such the displayed image. In this case, 
the control information for decreasing the delivery 

20 data amount is transmitted from the reception site 9 ' 
to such the transmission site. 

In the above-described embodiments, the delivery 
data amount was simply changed according to the 
temperature information. However, if the maximum 

25 delivery data amount from each transmission site is 
limited because of a communication system, the 
compression ratio and the frame rate may be adaptively 
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changed according to the temperature information. 

As described above, according to the present 
embodiment, how to emphasize the displayed image 
corresponding to the temperature information obtained 
5 from the heat sensor 4", the image quality and the 

frame rate can be adaptively determined according to 
the temperature represented by such the temperature 
information . 
[Modified Embodiments] 

10 The present invention can be applied as a part of 

the system which is composed of the plural equipment or 
can be also applied as a part of the apparatus 
comprising one equipment. 

The present invention is not limited to the 

15 apparatuses and the methods for realizing the above- 
described embodiments. That is, it is also included in 
the scope of the present invention a case where program 
codes of a software to realize the above-described 
embodiments are supplied to a computer (CPU or MPU) in 

20 the above system or the apparatus such that the system 
or the apparatus makes the various devices operative in 
order to realize the above-described embodiments in 
accordance with the supplied program codes. 

In this case, the program codes themselves of the 

25 software realize the functions of the above-described 
embodiments. Thus, the program codes themselves and a 
means, e.g., a storage medium to store the program 
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codes, for supplying the program codes to the computer 
are included in the scope of the present invention. 

As such the storage medium to store the program 
codes, e.g., a floppy disk, a hard disk, an optical 
disk, a magneto-optical disk, a CD-ROM, a magnetic 
tape, a non- volatile memory card, a ROM or the like can 
be used. 

Also, in addition to the case where the functions 
of the above-described embodiments are realized when 
the computer controls the various devices in accordance 
with only the supplied program codes, it is also 
included in the scope of the present invention the 
program codes in a case where the above-described 
embodiments are realized in cooperation with the OS 
(operating system) by which the program codes operate 
in the computer, an another application software, or 
the like. 

Further, it is included in the scope of the 
present invention a case where the supplied program 
codes are stored into a memory provided for a function 
expansion board of the computer or a function expansion 
unit connected to the computer and, after that, a CPU 
or the like provided for the function expansion board 
or the function expansion unit executes a part or all 
of the actual processes on the basis of instructions of 
the program codes, and the above-described embodiments 
are realized by such the processes. 
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According to the above-described embodiments, in 
the communication system or apparatus which can 
communicate the image and the voice, it can be provided 
the communication system or the image process system 
5 which is convenient for use by the user. 

Concretely, the reception image which is supposed 
to be important or interesting can be displayed in the 
state capable of being watched by the user as easy as 
possible, by utilizing the voice which is appendant to 
10 the reception image, the photographing conditions 

(e-g-^ pan angle, tilt angle, zooming magnification), 
and the like. 

Further, the reception image which is supposed to 
be important or interesting can be displayed such that 
15 the image can be distinguished from the other reception 
images, by emphasizing its frame or the like. 

The present invention can be variously modified 
and varied within the spirit and scope of the appended 
claims. 



20 
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WHAT IS CLAIMED IS: 

1. A communication system comprising a 
transmission apparatus for transmitting an image and a 
voice to be added to the image, and a reception 
5 apparatus for receiving the image and the voice, 
wherein 

said transmission apparatus comprises transmission 
means capable of selectively transmitting the image and 
the voice to said reception apparatus; and 

10 said reception apparatus comprises control means 

for controlling the image received from said 
transmission apparatus and causing predetermined 
display means to display the controlled image, on the 
basis of the voice transmitted by said transmission 

15 apparatus. 



2< A system according to Claim 1, wherein said 
one reception apparatus is connected to said plural 
transmission apparatuses to be able to selectively 
20 receive the image or the voice. 

3. A system according to Claim 2, wherein said 
control means causes said predetermined display means 
to display each of the images transmitted from said 
25 plural transmission apparatuses. 



4. A system according to Claim 1, wherein said 
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reception apparatus comprises said predetermined 
display means, 

5* A system according to Claim 1, wherein said 
control means emphasizes the image transmitted from 
said transmission apparatus, in accordance with 
contents of the voice transmitted from said 
transmission apparatus. 

6. A system according to Claim 5, wherein the 
emphasizing is to enlarge the image. 

7. A system according to Claim 5, wherein the 
emphasizing is to emphasize an outer frame of the 
image . 

8. A system according to Claim 1, wherein said 
reception apparatus comprises a speaker for outputting 
the voice. 

9. A system according to Claim 1, wherein said 
control means controls a voice level of the voice 
transmitted from the predetermined transmission 
apparatus, in accordance with contents of the voices 
transmitted from said plural transmission apparatuses. 

10. A system according to Claim 1, wherein said 
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control means controls resolution of the image 
transmitted from said transmission apparatus, in 
accordance with contents of the voice transmitted by 
said transmission apparatus. 

11. A communication system comprising a 
transmission apparats for transmitting an image and a 
voice to be added to the image, and a reception 
apparatus for receiving the image and the voice, 
wherein 

said transmission apparatus comprises transmission 
means capable of selectively transmitting the image and 
the voice to said reception apparatus, and 

said reception apparatus comprises, 

control means for controlling the image 
receiving from said transmission apparatus on the basis 
of the voice transmitted from said transmission 
apparatus , and 

display control means for receiving the image 
transmitted from said transmission apparatus and 
causing predetermined display means to display the 
received image . 

12. A system according to Claim 11, wherein said 
control means performs, in accordance with contents of 
the voice transmitted from said transmission apparatus, 
the controlling such that the different kinds of images 
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are received from said transmission apparatus from 
which the voice was received. 



13. A system according to Claim 12, wherein the 
5 different kinds of images include a still image and a 
moving image. 



14. A system according to Claim 11, wherein said 
control means selects whether or not the image is to be 

10 received from said transmission apparatus, in 

accordance with contents of the voice transmitted from 
said transmission apparatus. 

15. A system according to Claim 11, wherein said 
15 one reception apparatus is connected to said plural 

transmission apparatuses and can selectively receive 
the image or the voice. 



16. A system according to Claim 15, wherein said 
20 control means controls a voice level of the voice 

transmitted from predetermined one of said plural 
transmission apparatuses, in accordance with contents 
of the voices transmitted from said plural transmission 
apparatuses . 

25 

17. A system according to Claim 16, wherein said 
predetermined transmission apparatus is one of said 
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plural -transmission apparatuses ■fcransmi't'ting the 
voices, and is the transmission apparatus which 
transmitted the voice most satisfying a predetermined 
condition. 

5 

18. A system according to Claim 15, wherein said 
control means controls voice levels of the voices 
transmitted from said transmission apparatuses other 
than a predetermined transmission apparatus, in 
10 accordance with contents of the voices transmitted from 
said plural transmission apparatuses. 



19. A system according to Claim 11, wherein said 
reception apparatus further comprises a speaker for 
15 outputting the voice. 



20. A system according to Claim 11, wherein said 
control means controls resolution of the image 
transmitted from said transmission apparatus, in 
20 accordance with contents of the voice transmitted from 
said transmission apparatus. 



21. A communication system comprising a 
transmission apparatus for transmitting an image and a 
25 voice to foe added to the image, and a reception 
apparatus for receiving the image and the voice, 
wherein 
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said -fcransmission apparatus comprises, 

-bransmission means capable of selectively 
transmitting the image and the voice to said reception 
apparatus , and 
5 image pickup equipment control means for 

controlling a predetermined image pickup equipment to 
capture the image, and 

said reception apparatus comprises allocation 
means for allocating a control right to control an 
10 operation of said predetermined image pickup equipment, 
on the basis of the voice transmitted from said 
transmission apparatus. 

22. A system according to Claim 21, wherein 
15 said one reception apparatus is connected to said 

plural transmission apparatuses and can selectively 

receive the image or the voice, and 

said allocation means allocates the control right 

such that the operation of said image pickup equipment 
20 corresponding to predetermined one of said plural 

transmission apparatuses, in accordance with contents 

of the voices transmitted from said plural transmission 

apparatuses . 

25 23 . A system according to Claim 21 , wherein said 

transmission apparatus comprises said image pickup 
equipment. 
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24. An information process apparatus which can 
receive an image and a voice to be added to the image, 
from a transmission apparatus, said information process 
apparatus comprising : 
5 reception means capable of receiving the image and 

the voice to be added to the image; and 

control means for controlling the image received 
by said reception means and displaying the controlled 
image on predetermined display means, on the basis of 
10 the voice received by said reception means. 



25. An apparatus according to Claim 24, wherein 
said information process apparatus is connected to said 
plural transmission apparatuses and can selectively 

15 receive the image or the voice. 

26. An apparatus according to Claim 25, wherein 
said control means can cause said predetermined display 
means to display each of the images transmitted from 

20 said plural transmission apparatuses. 

27. An apparatus according to Claim 24, wherein 
said information process apparatus comprises said 
predetermined display means • 

25 

28. An apparatus according to Claim 24, wherein 
said control means emphasizes the image transmitted 
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from said transmission apparatus, in accordance with 
contents of the voice transmitted from said 
transmission apparatus . 

5 29, An apparatus according to Claim 28, wherein 

the emphasizing is to enlarge the image, 

30, An apparatus according to Claim 28, wherein 
the emphasizing is to emphasize an outer frame of the 

10 image. 

31, An apparatus according to Claim 24, further 
comprising a speaker for outputting the voice. 

15 32. An apparatus according to Claim 25, wherein 

said control means controls a voice level of the voice 
transmitted from predetermined one of said plural 
transmission apparatuses, in accordance with contents 
of the voices transmitted from said plural transmission 

20 apparatuses . 

33 An apparatus according to Claim 24, wherein 
said control means controls resolution of the image 
transmitted from said transmission apparatus, in 
25 accordance with contents of the voice transmitted from 
said transmission apparatus . 
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34. An information process apparatus which can 
receive an image and a voice to be added to the image, 
from a transmission apparatus, said information process 
apparatus comprising : 

5 reception means capable of receiving the image and 

the voice to be added to the image; 

control means for controlling the image receiving 
of said reception means, on the basis of the voice 
received by said reception means; and 
10 display control means for causing predetermined 

display means to display the image received by said 
reception means . 

35. An apparatus according to Claim 34, wherein 
15 said control means performs the controlling such that 

the different kinds of the images are received from 
said transmission apparatus from which the voice was 
received, in accordance with contents of the voice 
transmitted from said transmission apparatus. 

20 

36. An apparatus according to Claim 35, wherein 
the different kinds of the images include a still image 
and a moving image. 

25 37. An apparatus according to claim 34, wherein. 

said control means selects whether the image is to be 
received from said transmission apparatus, in 
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accordance with contents of the voice transmitted from 
said transmission apparatus. 

38. An apparatus according to Claim 34, wherein 
5 said information process apparatus is connected to said 
plural transmission apparatuses and can selectively 
receive the image or the voice. 



39. An apparatus according to claim 38, wherein 
10 said control means controls a voice level of the voice 

transmitted from predetermined one of said plural 
transmission apparatuses, in accordance with contents 
of the voices transmitted from said plural transmission 
apparatuses . 

15 

40. An apparatus according to Claim 39, wherein 
said predetermined transmission apparatus is one of 
said plural transmission apparatuses transmitting the 
voices, and is the transmission apparatus which 

20 transmitted the voice most satisfying a predetermined 
condition. 



41. An apparatus according to Claim 38, wherein 
said control means controls voice levels of the voices 
25 transmitted from said transmission apparatuses other 
than a predetermined transmission apparatus, in 
accordance with contents of the voices transmitted from 
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said plural transmission apparatuses. 

42. An apparatus according to Claim 34, wherein 
said information process apparatus further comprises a 
speaker for outputting the voice. 

43. An apparatus according to Claim 34, wherein 
said control means controls resolution of the image 
transmitted from said transmission apparatus, in 
accordance with contents of the voice transmitted from 
said transmission apparatus. 

44. An information process apparatus which can 
communicate with a transmission apparatus having a 
predetermined image pickup equipment for capturing an 
image, said information process apparatus comprising: 

reception means capable of receiving the image and 
a voice to be added to the image; and 

allocation means for allocating a control right to 
control operation of said image pickup equipment, on 
the basis of the voice received by said reception 
means . 

45. An apparatus according to Claim 44, wherein 
said information process apparatus is connected to 

said plural transmission apparatuses and can 
selectively receive the image or the voice, and 
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said allocation means allocates the control right 
such that the operation of said image pickup equipment 
corresponding to predetermined one of said plural 
transmission apparatuses can be controlled, in 
accordance with contents of the voices transmitted from 
said plural transmission apparatuses. 

46. An information process method which can 
receive an image and a voice to be added to the image, 
from a transmission apparatus, said method comprising: 

a reception step of receiving the image and the 
voice to be added to the image; and 

a control step of controlling the image received 
in said reception step and causing a predetermined 
display means to display the controlled image, on the 
basis of the voice received in said reception step. 

47. An information process method which can 
receive an image and a voice to be added to the image, 
from a transmission apparatus, said method comprising: 

a reception step of receiving the image and the 
voice to be added to the image; 

a control step of controlling the image receiving 
in said reception step, on the basis of the voice 
received in said reception step; and 

a display control step of causing a predetermined 
display means to display the image received in said 
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reception step. 

48. An information process method which can 
communicate with a transmission apparatus having a 
predetermined image pickup equipment for capturing an 
image, said method comprising: 

a reception step of receiving the image and a 
voice to be added to the image; and 

an allocation step of allocating a control right 
to control operation of the image pickup equipment, on 
the basis of the voice received in said reception step. 

49. A storage medium which stores, in a computer 
readable state, a program supplied to an apparatus 
which can receive an image and a voice to be added to 
the image, from a transmission apparatus, said program 
comprising : 

a reception step of receiving the image and the 
voice to be added to the image; and 

a control step of controlling the image received 
in said reception step and causing a predetermined 
display means to display the controlled image, on the 
basis of the voice received in said reception step. 

50. A storage medium which stores, in a compute?: 
readable state, a program supplied to an apparatus 
which can receive an image and a voice to be added to 
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the image, from a transmission apparatus, said program 
comprising: 

a reception step of receiving the image and the 
voice to be added to the image; 
5 a control step of controlling the image receiving 

in said reception step, on the basis of the voice 
received in said reception step; and 

a display control step of causing a predetermined 
display means to display the image received in said 
10 reception step. 



51. A storage medium which stores, in a computer 
readable state, a program supplied to an apparatus 
which can communicate with a transmission apparatus 
15 having a predetermined image pickup equipment for 
capturing an image, said program comprising: 

a reception step of receiving the image and a 
voice to be added to the image; and 

an allocation step of allocating a control right 
20 to control operation of the image pickup equipment, on 
the basis of the voice received in said reception step. 



52. A system according to Claim 1, wherein the 
controlling by said control means is performed on the 
25 basis of a voice level of the voice. 



53. A communication system comprising a 
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-transmission apparatus for transmit:t;ing an image and a 
voice to be added to the image, and a reception 
apparatus for receiving the image and the voice, 
wherein 

5 said transmission apparatus comprises, 

data amount control means for controlling a 
data amount of the image on the basis of a level of the 
voice to be added to the image, and 

transmission means for transmitting the image 
10 of which data amount was controlled by said data amount 
control means , and 

said reception apparatus comprises;. 

reception means for receiving the image 
transmitted by said transmission means, and 
15 display control means for causing 

predetermined display means to display the image 
received by said reception means. 

54. A system according to Claim 53, wherein said 
20 reception apparatus further comprises image control 
means for controlling the image received by said 
reception means, in accordance with the level of the 
voice . 

25 55. A system according to Claim 53, wherein sai^ 

data amount control means controls the data amount of 
the image on the basis of plural threshold values. 



- 72 - 



56. A system according to Claim 54, wherein said 
image control means controls the image received by said 
reception means, on the basis of plural threshold 
values. 

5 

57. A system according to claim 54, wherein the 
controlling by said image control means is to emphasize 
the image. 

10 58. A system according to Claim 57, wherein the 

emphasizing of the image is to emphasize an outer frame 
of the image* 

59. A system according to Claim 53, wherein said 
15 communication system comprises the plural transmission 

apparatuses, said reception means receives the plural 
images transmitted from said transmission means of said 
plural transmission apparatuses, and said display 
control means causes said predetermined display means 
20 to simultaneously display the plural images. 

60. An information process apparatus which 
connects a reception apparatus receiving an image and a 
voice and transmits the image and the voice to be added 

25 to the image, comprising: 

data amount control means for controlling a data 
amount of the image, on the basis of a level of the 
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voice to be added to the image; and 

transmission means for transmitting the image 
controlled by said data amount control means* 



5 61. An information process method which transmits 

an image and a voice to be added to the image, to a 
reception apparatus, comprising: 

a data amount control step of controlling a data 
amount of the image, on the basis of a level of the 
10 voice to be added to the image; and 

a transmission step of transmitting the image 
controlled in said data amount control step. 



62* A storage medium which stores, in a computer 
15 readable state, an information process program which is 
used to transmit an image and a voice to be added to 
the image, to a reception apparatus, said program 
comprising: 

a data amount control step of controlling a data 
20 amount of the image, on the basis of a level of the 
voice to be added to the image; and 

a transmission step of transmitting the image 
controlled in said data amount control step. 



25 



63. An information process apparatus which is 
connected to a transmission apparatus capable of 
transmitting an image and a voice to be added to the 
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image, and controlling a data amount: at a time when the 
image is transmitted, said information process 
apparatus comprising: 

reception means for receiving the image and the 
5 voice transmitted by said transmission apparatus; and 

output means for outputting instruction 
information for controlling the data amount of the 
image transmitted by said transmission apparatus, on 
the basis of a level of the voice received by said 
10 reception means. 



64. An apparatus according to Claim 63, further 
comprising display control means for displaying the 
image received by said reception means, on a monitor. 

15 

65. An apparatus according to Claim 63, further 
comprising a monitor for displaying the image received 
by said reception means. 



20 66. An apparatus according to Claim 64, wherein 

said information process apparatus is connected to 
plural transmission apparatuses including said 
transmission apparatus, 

said reception means receives the plural images 
25 and the voices transmitted from said plural 
transmission apparatuses, and 

said display control means simultaneously displays 
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the plural images and the voices received by said 
reception means, on said monitor. 

67. A method for controlling a reception 
5 apparatus which is connected to a transmission 

apparatus capable of transmitting an image and a voice 
to be added to the image, and of controlling a data 
amount at a time when the image is transmitted, said 
method comprising: 

10 a reception step of receiving the image and the 

voice transmitted by the transmission apparatus; and 

an output step of outputting instruction 
information for controlling the data amount of the 
image transmitted by the transmission apparatus, on the 

15 basis of a level of the voice received in said 
reception step . 

68. A storage medium which stores, in a computer 
readable state, a control program for controlling a 

20 reception apparatus which is connected to a 

transmission apparatus capable of transmitting an image 
and a voice to be added to the image, and of 
controlling a data amount at a time when the image is 
transmitted, said program comprising: 

25 a reception step of receiving the image and the 

voice transmitted by the transmission apparatus; and 
an output step of outputting instruction 
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information for controlling the data amount of the 
image transmitted by the transmission apparatus, on the 
basis of a level of the voice received in said 
reception step. 

5 

69. A communication system which comprises a 
transmission apparatus for transmitting an image 
photographed by predetermined image pickup means, and a 
reception apparatus for receiving the transmitted 

10 image, wherein 

said transmission apparatus comprises transmission 
means for transmitting the image, and 

said reception apparatus comprises, 

reception means for receiving the image 
15 transmitted by said transmission means, 

image control means for controlling the image 
received by said reception means, in accordance with an 
environment in which the image is photographed, and 
display control means for causing 
20 predetermined display means to display the image 
controlled by said image control means. 

70. A system according to claim 69, wherein the 
environment in which the image is photographed is 

25 either one of a pan angle, a tilt angle and zooming 
magnification of said image pickup means in case of 
photographing . 
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71, A system according to Claim 69, wherein the 
environment in which the image is photographed is a 
temperature . 

5 72. A system according to Claim 71, wherein said 

transmission apparatus comprises a sensor for detecting 
the temperature . 

73, A communication system which comprises a 
10 transmission apparatus for transmitting an image 

photographed by predetermined image pickup means, and a 
reception apparatus for receiving the transmitted 
image, wherein 

said transmission apparatus comprises, 
15 data amount control means for controlling a 

data amount of the image in accordance with an 
environment in which the image is photographed, and 

transmission means for transmitting the image 
controlled by said data amount control means, and 
20 said reception apparatus comprises, 

reception means for receiving the image 
transmitted by said transmission means, and 
display control means for causing 
predetermined display means to display the image 
25 received by said reception means. 

74. A system according to Claim 73, wherein the 
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environment: in which the image is photographed is 
either one of a pan angle, a tilt angle and zooming 
magnification of said image pickup means in case of 
photographing . 

75. A system according to Claim 73, wherein the 
environment in which the image is photographed is a 
temperature . 



10 



76. A system according to Claim 75, wherein said 
transmission apparatus comprises a sensor for detecting 
the temperature • 
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ABSTRACT OF THE DISCLOSURE 

An object: of the present; invention is to perform, 
in a communication system or apparatus which can 
communicate an image and a voice, image displaying and 
voice outputting which are convenient to an operator or 
a user, and concretely to provide technique which can 
conveniently or easily direct or teach to the operator 
importance of the image and the voice to be 
communicated . 

In order to achieve the object, it is provided in 
the present invention a communication system comprising 
a transmission apparatus for transmitting an image and 
a voice to be added to the image, and a reception 
apparatus for receiving the image and the voice, 
wherein the transmission apparatus comprises a 
transmission means capable of selectively transmitting 
the image and the voice to the reception apparatus, and 
the reception apparatus comprises a control means for 
controlling the image received from the transmission 
apparatus and causing a predetermined display means to 
display the controlled image, on the basis of the voice 
transmitted by the transmission apparatus. 
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COMBINED DECLARATION AND POWER OF ATTORNEY FOR 
ORIGINAL. DESIGN, NATIONAL STAGE OF PCT, SUPPLEMENTAL, 
DIVISIONAL. CONTINUATION OR CONTINUATION-IN-PART APPLICATION 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and 
joint inventor (if plural names are listed below) of the subject matter which is claimed and for which a patent is 
sought on the invention entitled: 

COMMUNICATION SYSTEM, INFORMATION PROCESSING APPARATUS AND METHOD 

AND STORAGE MEDIUM ' 

the specification of which 

a. M is attached hereto 

b. [ ] was filed on as application No. 

and was amended on (if applicable), 

PCT FILED APPLICATION ENTERING NATIONAL STAGE 

c. [ 1 was described and claimed in International Application No. filed on and 

as amended on . (if any). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including 
the claims, as amended by any amendment referred to above, 

I acknowledge the duty to disclose iafonnation which is material to the examination of this application in 
accordance with Title 37, Code of Federal Regulations, § 1.56(a). 

M I hereby claim foreign priority benefits under Title 35, United States Code § 119 of any foreign 
application(s) for patent or invMitor's certificate listed below and have also identified below any foreign 
application for patent or inventor's certificate having a filing date before that of the application on which 
priority is claimed: 

[ ] The attached 35 U.S. C. § 119 claim for priority for the U.S. application(s) listed below forms a 
part of this declaration. 



Country 


Application 
Number 


Date of filing Date of issue 
fdav, month, vt) fdav, month, vr'i 


Priority 
Claimed 




^ JAPAN 


8-319516 


2 9 No vembe r 199 6 


pel YES r 


1 NO 


JAPAN 


9-253434 


18 Septembei: 1997 


P^I YES r 


1 NO 
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ADDITIONAL STATEMENTS FOR 
DIVISIONAL. CONTINUATION OR CONTINUATION>TN-PART 

I hereby claim the benefit under Title 35, United States Code § 120 of any United States application(s) listed 
below. 



.^jplication Serial No. Filing Date Status (patented, 

pending, abandoned) 



Application Serial No. Filing Date Status (patented, 

pending, abandoned) 

[ ] In this continuation-in-part application, insofar as the subject matter of any of the claims of this 
application is not disclosed in the prior United States application in the manner provided by the first paragraph 
of Title 35, United States Code, § 112, I acknowledge the duty to disclose material information as defined in 
Title 37, Code of Federal Regulations, § 1.56(a) which occurred between the filing date of the prior application 
and the national or PCT international filing date of this application. 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge 
that willful false statOTients and the like so made are punishable by fine or In^risonment, or both, under Section 
1001 of Title 18 of the United States Code and that such willful false statements may jeopardize the validity of 
the application or any patent issued thereon. 

I hereby appoint the following attorneys and/or agents with full power of substitution and revocation, to 
prosecute this application, to receive the patent, and to transact all biisiness in the Patent and Trademark Office 
connected therewith: Jerome G. Lee (Reg. No, 16,967), John D. Foley (Reg. No. 16,836), John A. Diaz 
(Reg. No. 19,550), Thomas P. Dowling (Reg, No. 19,221), John C. Vassil (Reg, No. 19,098), Warren H. 
Rotert (Reg. No, 19,659), Alfred P. Ewert (Reg. No. 19,887), David H. Pfeffer, P.C. (Reg. No. 19,825), 
Haary C. Marcus (Reg. No. 22,390), Robert E. Paulson (Reg. No. 21,046), Stephen R. Smith (Reg. No. 
22,615), Kurt E. Richter (Reg. No. 24,052). J. Robert Dailey (Reg. No. 27,434), Eugene Moroz (Reg. No. 
25,237), John F. Sweeney (Reg. No. 27,471), Arnold I. Rady (Reg, No. 26,601), Christopher A. Hughes 
(Reg. No. 26,914), William S. Feiler (Reg. No. 26,728), Joseph A. Calvaruso (Reg. No, 28,287), James W. 
Gould (Reg. No. 28,859), Richard C. Komson (Reg. No. 27,913), Israel Blum (Reg. No. 26,710), 
Bartholomew Verdirame (Reg. No. 28,483), Maria C. H. Lin (Reg. No. 29,323), Joseph A. DeGirolamo (Reg. 
No. 28,595), and Christopher E. Chalsen (Reg. No. 30,936) of Morgan & Finnegan whose address is: 345 
Park Avenue, New York, New York 10154. 

M I hereby authorize the U.S. attorneys and/or agents named hereinabove to accept and follow 

instructions from ] 

^as to any action to be taken in the U.S. Patent and Trademark Office 

regarding this application without direct communication between the U.S. attorneys and/or agents and 
me. In the event of a change in the person(s) from whom instructions may be taken I will so notify the 
U.S. attorneys and/or agents named hereinabove* 
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I hereby specify the following as the correspondence address to which all communications about this application 
are to be directed: 



SEND CORRESPONDENCE TO: 

MORGAN & FINNEGAN, 345 Park Av^ue, New York, New Yoik 10154 

DIRECT TELEPHONE CALLS TO: 

(212) 758-4800 



Full name of sole or first inventor AK IH I RO KOHNO 



Inventor*s signature* /^'^Zho^ ^t&r-^yyuy^ 



15-18, Hiratsuka 2-chome, Shinagawa-ku ,date 
Residence Tokyo, Japan Tlm^e^^S^ 7. ^^^7 



CitLaeoship JAPAN 



c/o Canon Kabushiki Kaisha 
Post Office Address 30-2. Shimomaruko 3-chome. Ohta-ku, Tokyo. Japan 



Full name of second joint inventor, if any , 
Inventor's signature* 



date 

Residence _„^^ 



Citizenship , 



Post Office Address 



[ ] ATTACHED IS ADDED PAGE TO COMBINED DECLARATION AND POWER OF ATTORNEY 

FOR SIGNATURE BY THIRD AND SUBSEQUENT INVENTORS FORM. 



* Before signing this declaration, each person signing must: 

1. Review the declaration and verify the correctness of all information therein; and 

2. Review the specification and the claims, including any amendments made to the claims. 
Af^ the decteadon is signed, the specification and claims are not to be alt^ed. 

To tibe inventor(s): 

llie following are cited in or pertinent to the declaration attached to the accon:q)anying application: 
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Title 37. Code of Federal Regulation, $1.56 



Duty of disclosure.... 

(a) A duty of candor and good faith toward the Patent and Trademark Office rests on the inventor, 
on each attorney or agent who prepares or prosecutes the application and on every other individual who is 
substantively involved in the preparation or prosecution of the ^plication and who is associated with the 
inventor, with the assignee or with anyone to whom there is an obligation to assign the application. All such 
individuals have a duty to disclose to the Office information they are aware of which is material to the 
examination of the application. Such information is material where there is a substantial likelihood that a 
reasonable examiner would consider it in^>ortant in deciding whether to allow the application to issue as a 
patent. The duty is commensurate with the degree of involvement in the preparation or prosecution of the 
application. 

4i 4t 4i 4i 

c) Any application may be stricken from the files if: 

(1) An oath or declaration ... is signed in blank; 

(2) An oath or declaration ... is signed without review thereof by the person making the 
oath or declaration; 

(3) an oath or declaration ... is signed without review of the specification, including the 
claims ...; 

or 

(4) Hie application papers filed in the Office are altered after the signing of an oath or 
declaration ... referring to those application papers. 

Title 35. U.S. Code. 119 

Benefit of earlier filing date in foreign coimtry; right of priority 

An ^plication for patent for an inventor filed in this country by any person who has, or whose legal 
representatives or assigns have, previously regularly filed an application for a patent for the same inventor in a 
foreign country which affords similar privileges in the case of applications filed in the United States or to 
citizens of the Unified States, shall have the same effect as the same application would have if filed in this 
country on the date on which the application for patent for the same invention was first filed in such foreign 
country, if the application in this country is filed within twelve months from the earliest date on which such 
foreign application was filed; but no patent shall be granted on any application for patent for an invention which 
had been patented or described in a printed publication in any country more than one year before the date of the 
actual filing of the application in this country, or which had been in public use or on sale in this country more 
than one year prior to such filing. 

Title 35. U.S. Code. ^ 102 

Benefit or earlier filing date in the United States 

An application for patent for an invention disclosed in the maimer provided by the first paragraph of 
section 112 of this title in an application previously filed in the United States, or as provided by section 363 of 
this title, which is filed by an inventor or inventors named in the previously filed application shall have the 
same effect, as to such invention, as though filed on the date of the prior application, if filed before the 
patenting or abandonment of or termination of proceedings on the first application or an application similarly 
entitled to &e benefit of the filing date of the first apptication and if it contains or is amended to contain a 
specific reference to the earlier filed application. 

Title 35. U.S. Code S 101 

Inventions patentable 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the conditions and 
requirements of this title. 
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Title 35 U.S. Code ^ 102 

Conditions for patentability; novelty and loss of right to patent 
A person shall be ^titled to a patent unless — 

(a) the inv^tion was known or used by others in this country, or patented or described in a 
printed publication in this country, more than one year prior to the date of the application for patent in the 



(b) the inv^tion was patented or described in a printed publication in this or foreign country or in 
public use or on sale in this country, more than one year prior to the date of application for patent in the United 
States, or 



States, or 

(c) he has abandoned the inventor, or 



(d) the invMition was first patented or caused to be patented, or was the subject of an inventor*s 
certificate, by the applicant or his legal representatives or 

assigns in a foreign country prior to the date of the application for patent in this country on an application for 
patent or mv^tor's certificate filed more than twelve months before the filing of the application in the United 
States, or 



(e) the invention was described in a patent granted on an application for patent by another filed in 
the United States before the invention thereof by the applicant for patent, or on an international application by 
another who has fulfilled the requirements of paragraphs (1), (2), and (4) of section 37i(c) of this title before 
the invention thereof by the applicant for patent, or 

(0 he did not himself invent the subject matter sought to be patented, or 

(g) before the applicant's invention thereof the inv^ition was made in this country by another who 
had not abandoned, suppressed, or concealed it. In determining priority of invention there shall be considered 
not only the respective dates of conception and reduction to practice of the invention, but also the reasonable 
diligence of one who was first to conceive and last to reduce to practice, from a time prior to conception by the 
other ... 

Title 35. U.S. Code ^ 103 

Conditions for patentability; non-obvious subject matter 

A patent may not be obtained though the invention is not identically disclosed or described as set forth 
in section 102 of this title, if the differences between the subject matter sought to be patented and the prior art 
are such that the subject matter as a whole would have been obvious at the time the invention was made to a 
person having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived 
by the manner in which the invention was made. 

Subject matter developed by another person, which qualifies as prior art only under subsection (f) or 
(g) of section 102 of this title, shall not preclude patentability under this section where the subject matter and 
the claimed invention were, at the time the invention was made, owned by the same person or subject to an 
obligation of assignment to the same person. 

Title 35. U.S. Code g 112 fin part^ 

Specification 

The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear, concise and exact terms as to enable any person skilled in the art to 
wmcii It pertains, or wisii which it is most neariy connected, to make and use the same, and shall set forth the 
best mode conteinplated by the inventor of carrying out his invention. 

Please read carefully before signing the Declaration attache! to the accon^anying Application. 
If you have any questions, please contact Morgan & Fiimegan 

Rev. 2791 M&F 
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